Non-definability of languages by generalized 
first-order formulas over (N,+) 



Andreas Krebs 1 and A V Sreejith 2 

1 Wilhelm-Schickard Institut, Universtitat Tubingen, Germany 
2 Institute of Mathematical Sciences, Chennai, India 



Abstract. We consider first-order logic with monoidal quantifiers over 
words. We show that all languages with a neutral letter, definable using 
the addition predicate are also definable with the order predicate as the 
only numerical predicate. Let 5 be a subset of monoids. Let Cs be the 
logic closed under quantification over the monoids in 5, and N be the 
class of neutral letter languages. Then we prove that 

£s[<,+]nN = £ 5 [<]nN 

Our result can be interpreted as the Crane Beach conjecture to hold 
for the logic Cs [<,+]■ As a consequence we get the result of Roy and 
Straubing that FO+MOD[<,+] collapses to FO+MOD[<]. For cyclic 
groups, we answer an open question of Roy and Straubing, proving that 
MOD[<,+] collapses to MOD[<]. Our result also shows that multiplica- 
tion as a numerical predicate is necessary for Barrington's theorem to 
hold and also to simulate majority quantifiers. 

All these results can be viewed as separation results for highly uniform 
circuit classes. For example we separate FO[<,+]-uniform CC° from 
FO[<,+]-uniform ACC°. 



1 Introduction 

Consider a language with a "neutral letter", i.e. a letter which can be inserted 
or deleted from any word in the language without changing its membership. The 
neutral letter concept has turned out to be useful for showing non-expressibility 
results. It had been used to establish super linear lower bounds for bounded- 
width branching programs [4] and for the number of wires in circuit classes [12]; 
it also led to results in communication complexity [9] . But mostly the concept is 
known in the context of the Crane Beach conjecture proposed in [2]. There it was 
conjectured that first order logic with arbitrary numerical predicates (denoted 
as arb) collapses to first order logic with only linear ordering in the presence of 
a neutral letter. The idea is that, in the presence of a neutral letter, formulas 
cannot rely on the precise location of input letters and hence numerical predicates 
will be of little use. Let N denote the class of languages with neutral letters. Let 
S be a set of finite monoids and Cs be the logic closed under quantification, 



where the quantifiers are Lindstrom quantifiers over some monoid from S. Then 
the Crane Beach conjecture says that 

£ s [arb]nN = £ s [<]nN. 

The conjecture was refuted by Barrington et. al. [2], where they showed that it 
does not hold for the logic FO[<,+, *], i.e. first order logic (S consists of only 
the existential quantifier) using addition and multiplication relation. In the same 
paper, the authors proved that the conjecture holds for various other logics. The 
Boolean closure of the Zi -fragment of FO[arb] satisfies the conjecture; that is 
B(Z'i)[arb]nN = S(Z' 1 )[<]nN. Lautemann, Tesson and Therien [16] considered 
quantifiers which can count modulo a prime p (called MOD p ). They obtain that 
B(i7°' p )[arb] n N = B(S° 1 ' P )[<] n N, which is equivalent to MOD p [arb] n N = 
MOD p [<]nN. 

Benedikt and Libkin [8] in the context of collapse results in database theory 
showed that first order logic with only addition satisfies the Crane Beach conjec- 
ture. A different proof of the result can be found in [2]. We generalize this result 
to arbitrary monoidal quantifiers. Let S be a set of finite monoids. Consider the 
logic £g where the quantifiers are Lindstrom quantifiers whose languages are 
word problems of monoids in S. Our main result (Theorem 2) is that the Crane 
Beach conjecture hold for the logic £s[<, +}', that is 

£ 5 [<,+]nN = £ 5 [<]nN. 

If S is an aperiodic monoid, then the theorem is equivalent to the result of 
Benedikt and Libkin. For solvable monoids Roy and Straubing [22] (used ideas 
of Benedikt and Libkin to) showed that in the presence of neutral letters FO + 
MOD[<, +] collapse to FO + MOD[<]. In their paper they raised the question: 
does MOD[<, +] satisfy the Crane Beach conjecture? This can be answered by 
our main theorem. 

Our results can also be viewed from the perspective of descriptive complexity 
of circuit classes. The books [11, ?] present the close connection between logics 
with monoid quantifiers and circuit classes. We know that the set of languages ac- 
cepted by uniform- AC circuits are exactly those definable by first order logic us- 
ing order, addition and multiplication relations. Similarly CC° (constant depth, 
polynomial size circuits with MOD-gates) corresponds to MOD[<, +, *], ACC° 
corresponds to FO + MOD[<, +, *], TC° corresponds to MAJ[<, +, *], and NC 1 
corresponds to GROUP[<,+,*] (The "group quantifier" evaluates over a fi- 
nite group). It is a well known result that AC is separated from ACC° [10], 
but relationships between most other classes are open. For example, we do not 
know whether CC° is different from ACC°. In fact we do not know whether 
MOD 6 [<, +, *] contains uniform-AC . This explains why the Crane Beach con- 
jecture for prime modulo quantifiers [16], using arbitrary predicates, cannot be 
easily extended to composite modulo quantifiers. 

We look at these separation questions from the descriptive complexity per- 
spective. As a first step, one can ask the question of separating the logics 



without the multiplication relation. That is, can one separate MOD[<,+] from 
FO + MOD[<, +]? Is GROUP [<, +] different from FO + MOD[<, +]? 

Behle and Lange [7] gave a notion of interpreting £s[<, +] as highly uniform 
circuit classes. Our results therefore can be summarized as: every FO[<, +] uni- 
form constant depth polynomial size circuit with gates that compute a product 
in S and that recognizes a language with a neutral letter can be made FO[<]- 
uniform. 

As a consequence of our main theorem we are able to separate these uniform 
versions of circuit classes. For example: The theorem states that MOD[<,+] 
definable languages with a neutral letter are also definable in MOD[<]. Since 
MOD[<] cannot simulate the existential quantifiers [26] we have that FO[<, +] 
and MOD[<, +] are incomparable. In fact we show that no group quantifier can 
simulate existential quantifier if only addition is available. This gives an alter- 
nate proof of the known result [22] that FO + MOD m [<, +] cannot count modulo 
a prime p, where p does not divide m. Another consequence is that the majority 
quantifier cannot be simulated by group quantifiers if multiplication is not avail- 
able, thus separating MAJ[<, +] from FO+GROUP[<, +]. Barrington's theorem 
[1] says that word problems over any finite group can be defined by the logic 
which uses only the S$ group quantifier (the group whose elements are the set of 
all permutations over 5 elements) if addition and multiplication predicates are 
available. Our result shows multiplication is necessary for Barrington's theorem 
to hold. In other words S 5 cannot define word problems over 5 6 if only addition 
is available. 

Non expressibility results for various logics which uses addition and a variety 
of quantifiers have been considered earlier. Lynch [19] proved that FO[<, +] can- 
not count modulo any number. Nurmonen [21] and Niwihski and Stolboushkin 
[25] looked at logics with counting quantifiers equipped with numerical predicates 
of form y = px and a linear ordering. Ruhl [23] , Schweikardt [24] , Lautemann 
et.al. [15], Lange [14] all showed the limited expressive power of addition in the 
presence of majority quantifiers. Behle, Krebs and Reifferscheid [6, 5] proved that 
non-solvable groups are not definable in the two variable fragment of MAJ[<\. 

For the purpose of proof we work over infinite strings which contain finite 
number of non-neutral letters. Our general proof strategy is similar to Bencdikt 
and Libkin [8] or Roy and Straubing [22] and consists of three main steps. 

1. Given a formula <ft G £<s[<, +], we give an infinite set V G N and an "active 
domain formula" cj)' G Cs[<, +] such that for all words w whose non neutral 
positions belong to T> we have w N <j) w 1= <fr' . Active domain formu- 
las quantify only over non-neutral letter positions. Our major contribution 
(Theorem 17) is showing this step. 

2. We give another infinite set TCI? and an active domain formula ip G As[<] 
such that for all words w whose non neutral positions belong to T we have 
w \= (f)' w \= ip. This step follows from an application of Ramsey theory 
(Theorem 18). 

3. All active domain formulas in £$[<] accept languages with a neutral letter. 
This is an easy observation given by Lemma 19. 



Finally using these three steps we prove our main theorem. 

The main step is to build an active domain formula. Hence we need to show 
how to simulate a quantifier by an active domain formula. In the case of FO[<, +], 
the quantifiers, considered as Lindstrom quantifiers, have a commutative and 
idempotent monoid. Hence neither the order in which the quantifier runs over 
the positions of the word is important, nor does it matter if positions are queried 
multiple times. In Roy and Straubing this idea was extended in such a way 
that in the simulation of the MOD quantifier (again a commutative monoid), 
every position is taken into account exactly once. In their construction while 
replacing a MOD quantifier they need to add additional FO quantifiers and 
hence their construction only allows to replace a MOD[<,+] formula by an 
active domain FO + MOD[<, +] formula. In this paper, we construct a formula 
that takes every position into account exactly once and in the correct order. 
Moreover we do not introduce any new quantifier, but use only the quantifier 
that is replaced. This enables us to obtain the Crane Beach conjecture for logics 
whose quantifiers have a non-commutative monoid or are groups. For example 
MOD[<, +], GROUP [<, +], and FO + GROUP [<, +]. 

In contrast to previous work, we do not construct an equivalent active domain 
formula, but only a formula that is equivalent for certain domains. We show that 
it is in general sufficient to show this for one infinite domain. We also introduce 
a combinatorial structure called Sorting Tree which can be of interest on its 
own. Yet another contribution is to use inverse elements of groups to merge two 
sorted lists of numbers. 

We present our main theorem and its corollaries in Section 3 followed by a 
section with the proof of Theorem 17. Our main contribution is Section 5. There 
we replace group quantifiers by its active domain version. 



2 Preliminaries 

An alphabet S is a finite set of symbols. The set of all finite words over S 
is denoted by E* , the set of all right infinite words is denoted by . Let 
= Z*\JZ". Consider a language L C S°° and a letter A e S. We say that 
A is a neutral letter for L if for all u, v e S°° we have that u\v e L uv e L. 
We denote the set of all languages with a neutral letter by N. 

For a word w € S°° the notation w(i) denotes the i th letter in w, i.e. w = 
w{Q)w{\)w{2) .... For a word w in a language L with neutral letter A, we define 
the non-neutral positions nnp(iu) of w to be the set of all positions which do not 
have the neutral letter. 

A monoid is a set closed under a binary associative operation and has an 
identity element. All monoids we consider except for S* and S°° will be finite. 
A monoid M and S C M defines a word problem. Its language is composed of 
words w € M* , such that when the elements of w are multiplied in order we get 
an element in S. We say that a monoid M divides a monoid TV if there exists 
a submonoid N' of N and a surjective morphism from N' to M. A monoid M 



recognizes a language L C S* if there exists a morphism h : S* — > M and a 
subset TQM such that L = /i _1 (T). It is known that finite monoids recognize 
exactly regular languages [26]. We denote by .A/f the set of all finite monoids, 
Q C M. the set of all finite groups and MOD the set of all finite cyclic group. 
We denote by U\ the monoid consisting of elements {0, 1} under multiplication. 
For a monoid M, the element 1 £ M will denote its identity element. We also 
use the block product of monoids, whose definition can be found in [26]. For a 
set S of monoids, bpc(S) denotes the smallest set which contains S and is closed 
under block products. 

Given a formula with free variables x\, . . . , Xk, we write w, i\, . . . , ik |= 
if w is a model for the formula when the free variables Xj is assigned to ij 
for j — 1, . . . , k. We abuse notation and let c € £ also be the unary predicate 
symbols of the logic we consider. That is w, i \= c(x) iff w(i) — c. Let V be a 
set of variables, 1Z be a set of numerical predicates and S C A4. Wc define the 
logic Cs[lZ] to be built from the unary predicate symbols c, where c € S, the 
binary predicate {=}, the predicates in 1Z, the variable symbols V, the Boolean 
connectives {-■, V, A}, and the monoid quantifiers Q]J, where M € S is a monoid 
and m e M. We also identify the logic class C$ [R] with the set of all languages 
definable in it. 

Our definition of monoid quantifiers is a special case of Lindstrom quantifiers 
[18]. The formal definition of a monoid quantifier [3] is as follows. Let M = 
{nil , • ■ ■ i m Ki 1} be a monoid with K+ 1 elements. For an to £ M, the quantifier 
is applied on K formulas. Let £ be a free variable and <pi(x), . . . , </>k(x) be 
K formulas. Then w \= Q"^x((j)i(x), . . . , 4>k{x)) iff the word u when multiplied 
gives the element to, i.e. f\ i u(i) = m, where the i th letter of u, < i < \w\, is 



to! if w, i \= 4>i 
TO2 if w, i |= A 02 



if w, i |= -i0i A • • • A ~^(j)K-i A 0k 
1 otherwise 



The following "shorthand" notation is used to avoid clutter. We denote by 
Qm x 4> • ■ • ; a i<}, the formula Q^x{<f> A a\ 1 . . . , A ajf). Informally, this 
rclativizes the quantifier to the positions where is true, by multiplying the 
neutral element in all other places. 

Consider the monoid U\. It is easy to see that the word problem defined by 
U\ and the set {0} defines the regular language 1*0(0 + 1)*. Then Qf /i is same 
as the existential quantifier 3, since any formula of the form 3x0 is equivalent to 
Qij^x (0). So the logic £(/i[<] denotes first-order logic, FO[<]. Let C q stand for 
the cyclic group with q elements. Then the quantifiers Qc q corresponds to mod- 
ulo quantifiers [28]. Thus £mod[<] corresponds to all regular languages whose 
syntactic monoids are solvable groups [26]. For a sentence G Cs [R] wc define 
L{4>) = {w | w 1= 0}. The following result gives an algebraic characterization for 
the logic Cs[<}- 



Lemma 1 ([26]). Let S C M. Let L Q S* such that M is the smallest monoid 
which recognizes L. Then L is definable in C$[<\ iff M divides a monoid in 
bpc(S). 



3 Results 

Let S C M. be any set of monoids. We show that the Crane Beach conjecture is 
true for the logic Cs [<>+]• 

Theorem 2 (Main Theorem). Let S CM. Then 

£ 5 [<,+]nN = £ 5 [<]nN 

The proof of this theorem is given in Section 4. 

3.1 Non definability Results 

Theorem 2 give us the following corollaries. 

Corollary 3. All languages with a neutral letter in £a/([<,+] are regular. 

Proof. By Theorem 2 we know that all languages with a neutral letter in £ M [< 
, +] can be defined in Cm [<] which by Lemma 1 is the set of all regular languages. 

□ 

Recall that a monoid M divides a monoid N if M is a morphic image of a 
submonoid of N. 

Corollary 4. Let S C Q . Let G be a simple group that does not divide any 
monoid M in S. Then the word problem over G is not definable in Cs[<, +]• 

Proof. The word problem over G has a neutral letter. The result now follows 
from Theorem 2 and Lemma 1. □ 

The majority quantifier, Maj x <p{x) is given as follows. 

I w\ 

w 1= Maj x 4>{x) ^ \{i I w 1= <f>(i), i < \w\}\ > i y L 

MAJ[<] denotes the logic closed under majority quantifiers. It is known that 
the majority quantifier can be simulated by the non-solvable group if both 
multiplication and addition are available [29]. We show that multiplication is 
necessary to simulate majority quantifiers. 

Corollary 5. MAJ[<] £ C M [<,+]■ 



Proof. Consider the language L C {a, b, c}* consisting of all words with an equal 
number of a's and b's. L can be proven to be definable in MAJ[<]. Also note that 
c is a neutral element for L. By Corollary 3, and the fact that L is nonregular, 
we know that L is not definable in Cm [<,+]• □ 

Barrington's theorem [1] says that the word problem of any finite group can 
be defined in the logic £$ 5 [<, +> *]• The following theorem shows that multipli- 
cation is necessary for Barrington's theorem to hold. 

Corollary 6. The word problem over the group Sq is not definable in Cs 5 [< 
, +] . Infact there does not exist any one finite monoid M such that all regular 
languages can be defined in Cm[<, +]• 

Proof. A e is a simple subgroup of 5*6, which does not divide S5. From Corollary 
4 it follows that the word problem over 56 is not definable in £s 5 [<, +]. 
For any finite monoid M, there exists a simple group G such that G does not 
divide M and hence the word problem over G is not definable in Cm[<, +]■ d 

Let L p be the set of all words w € {0, 1}* such that the number of occurrences 
of 1 in w is equal to (mod p). Then we get the result in [22] that L p is not 
definable in FO + MOD m [<, +], if p is a prime which does not divide m. 

Corollary 7 ([22]). If p is a prime which does not divide m, then L p is not 
definable in FO + MOD m [<, +]. 

Proof. Let L p be definable in FO + MOD m [<,+]. Since is a neutral letter in 
L p , Theorem 2 says L p is also definable in FO + MOD m [<]. Due to Lemma 1 
and [26], this is a contradiction. □ 

It is an open conjecture whether the language 1* can be accepted by the 
circuit complexity class CC° [26]. It is also known that languages accepted by 
CC° circuits are exactly those which are definable by £mod[<,+,*] formulas 
[29]. 

To progress in this direction Roy and Straubing [22] had posed the question 
of whether 1* ^ £mod[<, +]• Below we show that this is the case. 

Corollary 8. 1* £ £mod [<,+]■ Infact 1* £ Cg[<,+}. 

Proof. The minimal monoid which can accept 1* is U\ and clearly the language 
is in N. By Theorem 2 if there is a formula in Cg [<, +] which can define 1* , then 
£g[<] can also define 1*. From Lemma 1 it follows that the monoid U\ divides 
a group. But this is a contradiction [26]. □ 

Bchle and Lange [7] give a notion of interpreting C$[<, +] as highly uniform 
circuit classes. As a consequence we can interpret the following results as a 
separation of the corresponding circuit classes. 

Corollary 9. The following separation results hold, for all m > 1 

- FO[<,+] % MOD[<,+]. 

- MOD m [<,+] % FO [<,+]. 

- FO[<, +] C FO + MOD m [<, +] C FO + MOD[<, +] 

- FO + MOD[<, +] C FO + GROUP [<, +] 

- MAJ[<, +] % FO + GROUP [<, +] 



3.2 Regular languages in Ls[<,+] 



We now look at regular languages definable by the logic Ls[<, +] , for an S C M . 
We first show that this logic is closed under quotienting. 

Lemma 10. Let S C M. and E be a finite alphabet. Let L C E* be definable in 

Ls[<,+] andu,v G E* . Thenu~ 1 Lv~ 1 is also definable in Lg[<,+]. 

Proof. □ 

We now show that the logic is also closed under inverse length perserving 
morphisms. 

Lemma 11. Let S C M. Let E, T be finite alphabets and let h : T* -> E* be a 
homomorphism such that h(r) C E r for some fixed r > 0. If L C E* is definable 
in L s [<,+], then /i _1 (L) C T* is also definable in L s [<,+]. 
Proof. □ 

We now give an algebraic characterization for regular languages definable by 
Ls[<,+]- 

Theorem 12. Let S C M. Let L C E* be a regular language. Then L is defin- 
able in Ls\<,+\ iff there exists a semigroup V and a morphism, h : E* — > V , 
such that for all k <G N, every monoid in h(E k ) is also in bpc(S). 
Proof. □ 

Let S be a set of monoids such that, given a monoid M, it is decidable if 
M divides a block product of monoids in S. Then, given a regular language L, 
it is decidable if L £ Lg[<]. Together with our main theorem we get that it is 
decidable if L e £s[<, +]. 

Corollary 13. Let S be a set of monoids such that, given a monoid M , it is 
decidable if M divides a block product of monoids in S. Then, given a regular 
language L, it is decidable if L G L$ [<,+]■ 

For FO + MOD[<,+] this was proved in [22]. Here we prove this for the 
special case when S = MOD. 

Corollary 14. Given a regular language L, the question whether L is definable 
mMOD[<,+] is decidable. 



4 Proof of the Main Theorem 

In this section we handle the general proof steps as in Libkin or Roy and Straub- 
ing of removing the plus predicate from the formula in the presence of a neutral 
letter. We show that all these results go through even in the presence of gen- 
eral Lindstrom quantifiers. The new crucial step is Lemma 15 where we convert 



a group quantifier to an active domain formula without introducing any other 
quantifiers. The proof of this lemma is deferred to the next section. 

Let S C M. be any nonempty set. To prove Theorem 2 we will consider the 
more general logic, £s[<, +, 0, {= q . q > 1}] over the alphabet S. In this logic 
+ is a binary function, is a constant, and a = q b means q divides b — a. The 
reason for introducing these new relations (which are definable using +) is to 
use a quantifier elimination procedure. All languages recognized by this logic are 
in C s [<,+]. 

The formulas we consider will usually define languages with a neutral letter. 
Let an active domain formula over a letter A e £ be a formula where all quanti- 
fiers are of the form: Q^x -iA(x)(0i, . . . , <Pk)- That is the quantifiers, quantify 
only over the "active domain" , the positions which does not contain the letter A. 
For the purpose of the proof we assume that the neutral letter language defined 
by a formula <p € +] is a subset of 17* A". The idea is to work with infinite 

words, where the arguments are easier, since the variable range is not bounded 
by the word length. 

For r £ N we define the set V r — {r % | < i € N}. We say that a formula 
4>(xi, . . . , x t ) G £$[<,+] collapses to </>', if <j>' is an active domain formula in 
£s[<> +] an d there exists an IZ^ € N such that for all r > TZ,p, w G S* A" with 
nnp(w) C T> r and for all a\, . . . , at € N we have that 

w f= 0(ai, . . . ,a t ) O w \= 4>'(ai,. . . ,a t ) 

In the above definition we say that collapse <j> to <f>'. 

The results by Benedikt and Libkin [8], and Roy and Straubing [22] show 
that for all formulas <j) & £modu(7i [<j +] there exists an active domain formula 
(/>' in that logic, such that for all words w € S* A", w 1= 4> <^> w 1= cf>' . They assume 
no restriction on the non-neutral positions of w. Observe that our collapse result 
is different from theirs. We prove that if we consider only words, whose non- 
neutral positions are in T> r , then any formula <p G As[<i +] 1S equivalent to the 
active domain formula <f>' <G Ls [<>+]■ That is, we are not concerned about the 
satisfiability of those words with non-neutral positions not in T> r . 

We show that formulas with a group quantifier, G € S can be collapsed. 

Lemma 15. Let (f> = Qqz{4>\, ■ ■ ■ , <$>k) be in £$[<,+]. Assume formulas 
</>!,..., 4> K collapse. Then <fi collapses to an active domain formula (f>' . 

The proof of Lemma 15 will be given in Section 5. Benedikt and Libkin [8] 
gives a similar theorem for the monoid U\ (the existential quantifier). 

Lemma 16 ([8]). Let (p = Qu^z{<l>ii <f>K) be a formula in £g[<, +}■ Let us 
assume that formulas (f>\,... <Pk collapse. Then collapses to an active domain 
formula (f)' . 

Recall the 3 steps for proving the main theorem given in Introduction. The 
following theorem proves the first step. 

Theorem 17. Let (f> G £$[<,+}. Then there exists an active domain formula 
<j)' € Cs[<,+] such that <f) collapses to <p' . 



Proof. Let <f> £ £5 [<,+]. We first claim that we can convert <f> into a formula 
which uses only groups and U\ as quantifiers. This follows from the Krohn- 
Rhodes decomposition theorem for monoids that every monoid can be decom- 
posed into block products over groups and U\. This decomposition can then be 
converted back into a formula using the groups and U\ as quantifiers [26]. 

So without loss of generality we can assume 4> has only group or U\ quan- 
tifiers. The proof is by induction on the quantifier depth. For the base case, 
let 4> be a quantifier free formula. It is an active domain formula and therefore 
the claim holds. Let the claim be true for all formulas with quantifier depth 
< d. Lemma 15 and Lemma 16 show that the claim is true for formulas of type 
4> — Q 1 M z { ( t ) i^ • • • 7 ^k) with quantifier depth d, when M is a group or Ui respec- 
tively. We are now left with proving that the claim is closed under conjunction 
and negation. So assume that formulas (f>i , (j} 2 collapse to <$ x , (j)' 2 respectively. 
That is there exist TZ ( f >1 , TZ^ £ N such that TZ c j >1 collapses <f>\ to (/)[ and 1Z^ 2 col- 
lapses cj)2 to 4>' 2 - Let 1Z = max{TZ ( p 1 ,TZ < p 2 }. Then it is easy to see that 1Z collapses 
4>i A 4>2 to (f>i A 02 and TZfa collapses -^1 to -xf>i. □ 

We have shown above that all formulas in £s[<, +] can be collapsed to active 
domain formulas. Now using a Ramsey type argument we obtain that addition 
is useless, giving us a formula in £s[<]. This corresponds to the second step in 
our three step proof strategy. 

Let R be any set of relations on N and let (j>(xi, . . . , xt) be an active domain 
formula in Cs[R\- We say that <f) has the Ramsey property if for all infinite 
subsets X of N, there exists an infinite set Y C X and an active domain formula 
ip £ £<s[<] that satisfies the following conditions. If w £ S* A" and nnp(w) C Y, 
then for all a\, . . . , at € Y, 

w \= <p(ai, ...,o t )oioN ip{ai, ...,a t ) 

The Ramsey property for first order logic has been considered by Libkin [17]. 
These results can be extended to our logic. 

Theorem 18. Let R be a set of relations on N. Every active domain formula 
in £ s [R] satisfies the Ramsey property. 

Proof. Let <f> £ Cs [R] be a formula. We now prove by induction on the structure 
of the formula. Let P(xi, . . . , Xk) be a term in (f>. We assume without loss of gen- 
erality that for all i 7^ j, Xi 7^ xj. Now consider the infinite complete hypergraph, 
whose vertices are labelled by numbers from X and whose edges are k tuple of 
vertices. Let i\, . . . , ik be some permutation of numbers from 1 to k. Consider 
the edge formed by the vertices v\ < v<i < ■ ■ ■ < Vk- We color this edge by the 
formula x i± < x i2 < • • • < x ik if P{v il , . . . ,v ik ) is true. Observe that each edge 
can have multiple colors and therefore the total number of different colorings 
possible is fc!. Ramsey theory gives us that there exists an infinite set Y C X, 
such that the induced subgraph on the vertices in Y will have a monochromatic 
color, ie. all the edges will be colored using the same color. Let us assume that 
the edges in Y are colored X\ < x 2 < • • • < x k . Then for all <Zi, . . . , a t £ Y 



ai,...,a t \= R(xi, . . . , Xk) ai, • • • , a t \= x\ < x 2 < ■ ■ ■ < x k 



This shows that P(xi, . . . , x k ) satisfies the Ramsey property and thus all atomic 
formulas satisfy the Ramsey property. We now show that Ramsey property is 
preserved while taking Boolean combination of formulas. Consider the formula 
<pi(xi, . . . ,Xk) A foixi, . . . ,Xk)- We know that by induction hypothesis there 
exists a formula tfti and an infinite set X such that for all ai,...,a t £ X, 
w |= (fti{a\, . . . , at) w \= tp(a\, . . . , a t ). We can now find an infinite set Y C X 
and a formula ^2 such that the Ramsey property holds for the formula (ft 2 . 
Therefore for all a\, . . . , a t £ Y 

w, ai, . . . , a k 1= (j>i A (f> 2 <^> w, 01, . . . , a k 1= Vi A tp 2 

Similarly we can show that the Ramsey property holds for disjunctions and 
negations. We need to now show that active domain quantification also preserves 
Ramsey property. So let X be an infinite subset of N and let 

4>{x) = QTmZ -,\(z) (</>i(z, x), . . . , <p K (z, x)) 

be a formula in Cg[R]. By induction hypothesis we know that there exists an 
infinite set Y\ C X and an active domain formula ipi £ >C[<] such that for all 
a £ Y{ the Ramsey property is satisfied. That is w \= (j)\{a) w \= ip\(a). Now 
for 02, using the infinite set Y\ we can find an infinite set Y2 Q Y\ and a formula 
■02 satisfying the Ramsey property. Continuing like this will give us a set Yk and 
formulas ipi, . . . , ipK such that Vj < K and for all w £ S* A" with nnp(w) C Yk, 
we have that V& £ Yk, a £Y k , w \= (f>j(b,a) w \= tpj(b, a). Hence we also have 
that Vj < K 

{b £ Y K I w N <t>j{b,a)} = {b £ Y K \ w t= ^(6,0)} 

Therefore for the formula ip — Q^z -*\(z) (tjji, . . . , iPk), we have Vw where 
nnp(w) C Yk and a\, . . . , a t £ Yk that 

w \= <p(ai, ...,« t )otoN ip{ai, . . . , a t ) 

Observe that ip is an active domain formula in £$[<]. □ 

We continue with the third step of our three step proof strategy. 

Lemma 19. Every active domain sentence in £s[<] define a language with a 
neutral letter. 

Proof. Let cf) £ Cs{<] be an active domain formula over letter A £ S. Let 
w £ S^. Let w' £ S u got by inserting letter A in w at some positions. Let 
n\ < n 2 < . . . belong to nnp(w) and mi < m 2 < . . . be in nnp(u/). Let 
p : nnp(ui) — > nnp(to') be the bijective map p(ni) — rrii. We show that for any 
subformula ip of (ft and any t £ nnp(w) s , we have that w, t \= ip w' , p{t) 1= ip. 
The claim holds for the atomic formula x > y, because > Uj iff p(ni) > p(nj) 
for an i, j. Similarly the claim also hold for all other atomic formulas x < y,x = y 
and a(x) for an a £ S. The claim remains to hold under conjunctions, negations 
and active domain quantifications. Hence w \= (ft w' \= (ft. This proves that A 
is a neutral letter for L(cft). □ 



Now we can prove our main theorem. 



Proof (Proof of Theorem 2). Let <f> e £5 [<,+], such that L{<p) is a language 
with a neutral letter, A. By Theorem 17 there exists an active domain sentence 
</>' € £s[<,+] over A and a set T>n such that 1Z collapses <j> to <f)' . Theorem 18 
now gives an active domain formula ip G £<s[<] and an infinite set Y C T>k- We 
now show that L(<j>) = L(ip). Let ai £ TA U Consider the word w' G A7*A W got 
by inserting the neutral letter A in w in such a way that nnp(w') C Y . Since 
L{4>) is a language with a neutral letter we have that w (= <fi w' 1= fa From 
Theorem 17 and Theorem 18 we get w' 1= w' \= <j>' w' \= ip. Finally 
as shown in Lemma 19, ip defines a language with a neutral letter and hence 
w' |= V"*=> w |= i>. □ 



5 Proof of Lemma 15 

In this section we replace a group quantifier by an active domain formula. Here 
we make use of the fact that we can a priory restrict our domain as shown in 
the previous section. 

Recall that <j> = Qqz(<P\, . . . , <$>k) and G = {mi, . . . , rriK, 1}- We know that 
for all i < K, there exists 71^ and a formula (/>[ such that 7Z t j )i collapse fa to 
<f>\. Then clearly maxllZ^} collapse fa to fa[ for all i < K. So without loss of 
generality we assume ^^s are active domain formulas. 

Before we go in the details we will give a rough overview of the proof idea. The 
group quantifier will evaluate a product ■ u(j) where u(j) is a group element 
that depends on the set of i such that w,j |= fa. So we start and analyze the 
sets Ji — {j I w,j |= fa}. Since the formulas fa are active domain formulas, 
we will see that there exists a set of intervals such that inside an interval the 
set Ji is periodic. Boundary points for these intervals are either points in the 
domain, or linear combinations of these. In the construction of the active domain 
formula for <f> we will show how to iterate over all these boundary points in a 
strictly increasing order. An active domain quantifier can only iterate over active 
domain positions, hence we will need nested active domain quantifiers, and a way 
how to "encode" the boundary points by tuples of active domain positions in a 
unique and order preserving way. Additionally we need to deal with the periodic 
positions inside the intervals, without being able to compute the length of such 
an interval, or even check if the length is zero. Here will make use of the inverse 
elements that always exist in groups. 

We start by analyzing the intervals which occur. We will pick an TZ^ > 
max{7^0 i } to collapse the formula fa During the course of the proof we will 
require JZ^, to be greater than a few others constants, which will be specified 
then. But always observe that TZ^ will depend only on (f>. 

Since we consider a fixed set S for the rest of the paper, we will write £[<,+] 
for the logic C s [<, +, 0, {= q : q > 1}]. 



5.1 Intervals and Linear Functions 



We first show that every formula ip with at least one free variable has a normal 
form. 

Lemma 20. Let ip(z) G £[<,+]. Then there exists a formula ip(z) G £[<,+] 
such that tp is equivalent to tp, where all atomic formulas in ip with z are of the 
form z > p, z — p, z < p, z = n p, where p is a linear function on variables other 
than z. 

Proof. Terms in our logic are expressions of the form 

a + a\X\ + • • • + a s x s , where on G N 
and atomic formulas are of the form 

a = 7, cr < 7, cr > 7, a = m 7, c{a) 
whee cr, 7 are linear functions, c£ X and m > 1. 

Now using any M G S, where m\ G M is not the neutral element, we can rewrite 
c(a) as 

Q T m x ^{ x ){i x ~ cr) A c(x), false, . . . , false) 

Now consider the atomic formulas containing the free variable z in ip(z). By 
multiplying with appropriate numbers, we can re- write these atomic formulas as 
nz = p,nz < p, nz > p, nz =1 p for one particular n, which is the least common 
multiple (1cm) of all the coefficients in tp. Here p does not contain z and also it 
might contain subtraction. That is nz = p might stand for nz + p\ = p2- Now 
we replace nz by z and conjunct the formula with z = n 0. □ 

For any formula ipiz), the notation ipiz) denotes the normal form as in 
Lemma 20. Let xi,...,x s be the bounded variables occurring in (f>i(z) and 
Di,...,y r be the free variables other than z in 4>i{z). Hence the terms p that 
appear in the formula <j>i(z) can be identified as functions, : N s+r — > N. 

We collect all functions p(x, y) that occur in the formulas <f>i(z) for an i < K: 

R = {p I where p is a linear term occurring in (f>i(z), i < K} 

We define the set T of offsets as a set of terms which are functions using the 
variables y\, . . . , y r as parameters: 

T={p(0,...,0,y u ...,y r )\peR}U{0} 

Consider the set of absolute values of all the coefficients appearing in one of 
the functions in R. Let a' G N be the maximum value among these. That is 
a' = max{\j\ | / G R, 7 is a coefficient in /}. Let A — s ■ a' . Now we can define 
our set of extended functions. For a t G T we define a set of terms which are 
functions using the variables xi, . . . , x s , yi, . . . , y r as parameters: 

s' 

F t = { XI a i x * + t\ s' < s,~A< ai < A,a, eN}. 



We denote by F — U t£ T-Pt- 

For a fixed word w £ S*X U ' and a fixed assignment of the free variables y to 
a we let 

B w ' a = {.fid, a)\t£T,f £F u d£ nn P ( W )% d 1 > d 2 > ■ ■ ■ > d s ,} 

be the set of boundary points. Note that the assignments to the functions are of 
strictly decreasing order. Let b\ < b 2 <•••<&/ be the boundary points in B w ' a . 

Then the following sets are called intervals: (—1, 61), (&i, b 2 ), • • ■ , 1 , &/), (6;, 00). 

Here (a, b) = {x £ N | a < x < b}. We also split the set of points in B w ' a de- 
pending on the offset 

B™' a = {f(d, a)\f£F t ,d£ nn P H s ', d x > d 2 > ■ ■ ■ > d s ,}. 

In the following Lemma we fix a word w £ S* A w and an a £ N r . 

Lemma 21. {p{d\, . . . , d s , a) \ p £ R,di £ nnp(w)} U nnp(w) C B w a 

Proof. Let S = {p(d\, . . . , d s , a) \ p e R, di e nnp(w)} U nnp(w). Since p{x\) = 
x\ is in F t , for some t £ T, we have nnp(w) C B w ' a . Let b E S. Then there is 
a function p = a^Xi + in F t and values pi, . . . ,p s > £ nnp(w) such that 
6 = p(pi, . . . ,p s ',a). Let > j>2 > ■ ■ ■ > v\ be the ordered set of all piS in the 
above assignment. We let p'{x\, . . . ,xi) = ^ i ftxi + t, where /3, = J2 r . Pj = P >. a j- 
Therefore b — p'(p' l7 . . . ,p[). Since < Z\ • s we have p' € F t and hence 

b £ /;;• ". □ 

Let g be the 1cm of all q' where = q i occurs in one of the fa. We need the 
following lemma, that inside an interval with only neutral letters, the congruence 
relations decide the truth of an active domain formula. 

Lemma 22. Let 01, . . . , a r £ N and let c,d £ N belong to the same interval in 
B w ' a such that c = q d. Then for all i < K : w,c\= fa(z, a) <^> w,d\= fa(z, a). 

Proof. Proof is by induction on the structure of the formula fa. We will now 
show that V6i £ nnp(w) and all subformulas tp(z,x,y) of fa that w,c,b,a \= 
tp <^=> w,d,b,a \= The atomic formulas of fa(z,a) are of the following form: 
z < p(x, a), z = p(x, a),z > p(x, a),z = q > p(x, a), a(z) and formulas which does 
not depend on z. It is clear that the truth of formulas which does not depend 
on z, a(z) and z = q > p docs not change whether we assign c or d to z. Let 
b £ nnp(w) s . By Lemma 21 wc know that p(b, a) is in B w ' a and since c, d lies in 
the same interval it follows that c < p(6, a) <^=> d < (b, a). Similarly we can show 
that the truth of z > p, z = p does not change on z being assigned c or d. Thus 
we have that the claim holds for atomic formulas. The claim clearly holds for 
conjunction and negation of formulas. Now let the claim hold for subformulas 
tpi, . . . , tpK- Therefore Vi < K we have that {b £ nnp(w) s | w, c, b, a \= ipi\ = 
{b £ nnp(w) s I w,d,b,a\= tpi}. Therefore we have that 

w,c,b 2 , ...,b s ,a\= Q™jx -.A(z)(V>i, • • -,iPk) 
^ w,d,b 2 , ...,b s ,a\= Q™jX -<\(x)(ipi, . . -,iPk) 



And hence it is closed under active domain quantification. 



□ 



The following Lemma deals with the infinite interval. 



Lemma 23. Let b belong to the infinite interval and a € N r . If w,a \= <f> then 
w.b.a^ fa for any i < K. 

Proof. Let i < K and b be in the infinite interval and w,b,a\= fa. From Lemma 
22 we know that all points c= q b and such that c is also in the infinite interval 
will be a witnesses for fa. This means the set of witnesses is infinite and hence 
w,a¥ fa □ 

Lemma 22 says that inside an interval, the congruence relations decide the 
satisfiability of the formulas fas. This shows that it is enough to know the truth 
values of fa at a distance of > q from the boundary points, since the truth values 
inside an interval are going to repeat after every q positions. The rest of the proof 
demonstrates 

1. How we can treat each B t differently. 

2. There is an active domain formula which goes through the points in B t in 
an increasing order 

We fix the word w € and assignment a. Therefore we drop the superscripts 

in B w ' a (B™- a ) and call them B (B t ). 



5.2 Treating each Bt differently 

Let p — q\G\, where q was defined in the previous section and depends on the 
= q i predicates. For an element g € G, we have g' G ' = 1q, so g x = <jr x +l G l. Recall 
the definitions of T, B from Section 5. 

Recall from the Preliminaries (Section 2) that we denoted by u(i) the group 
element at position i. That is u{i) = nij iff w, a \= fa: A A;<i^^'- ^ ur aml 
is to give an active domain formula such that the formula evaluates to true iff 
the group element Jli=o U W ^ s ec l ua l to m. The rest of this subsection will be 
devoted to computing this product in a way which helps in building an active 
domain formula. 

Let b < b' be boundary points in B. Below we compute n»=^+i m a 
different way: 

n u(i)=i[u(i) ( jjuw 

i=6+l i>b \i>b' 

Observe that we can compute the product of the interval using two terms that 
both need to know only one boundary of the interval. It becomes simpler if we 
note that the two products do not really need to multiply all the elements u(i), 
for i > b' but simply agree on a common set of elements to multiply. 

For a b e B, we define the function IL(6) to be the length of the interval to 
the left of b. That is if (6', b) form an interval then IL(6) = b—b'—l. Similarly we 
define IR(6) to be the length of the interval to the right of b. For all k < |T|, we 
define functions Nk(b) and Nk(b), which maps points b € B to a group element. 




N Q (b) = I 



u(b + l)u(b + 2)...u(b + IR(fe)) if IR(6) < p 
u(b+l)u(b + 2)...u(b + r) 

if IR(6) > p and r < p, b + r = p 

1 G if IL(6) < p 

u(b — p) . . . u(b — p + r) 



if IL(6) > p and r < p, b + r = p 



N ( P ) 

Inductively we define 

N k (b) = JV fc _!(&) [] (7V fc _ 1 (6')) _1 «(W-i(&'), 
b'eB tk 
b'>b 

N k (b) = N k -!(b) [] (Nk-iib^y 1 u{b')N k ^{b'). 

b'EB tk 
b'>b 

We first show how N k (b) and N k (b') are related for b, b' e £>. 

Lemma 24. Let < fc < \T\. Let b < b' e B such that there are no points 

b" G U> fe Bt 4 , «*ere 6 < b" < b'. Then JV fc (6)(JVj fe (&'))- 1 = nt"fe+i 

Proof. We prove this by induction over fc. Let fc = and let (b, b') form an 
interval in B. If 6' — b < p then 

(No(b))(N (b , ))- 1 = (u(b + l)u(b + 2)...u(b + lR(b)))(l G )- 1 = J] «(*) 

i=6+l 

If the interval is large, i.e. b' — b > p, then let s,i € N, be the smallest, 
resp. the largest numbers such that b < s < t < b' and s = p t = p 0. Lemma 
22 shows that inside an interval all positions congruent modulo q satisfy the 
same formulas. Therefore u(b' — p)u(b' — p + 1) . . . u(b' — 1) = Iq, and hence 
{u{b' -p)u{b' -p + l)...u{t))- x = (u(t + l)...u(6'-l)). So 

b'-l 

iV (6)(iV (fe')) _1 = (ti(HlHH2)...«(s))(u(f+l)...ii(i'-l)) = [] M W 

j=6+l 

The last equality being true since u(s + 1) . . . u(t) = 1q 

As induction hypothesis assume that the lemma is true for all k' < k. Since 
for all b" > b' the terms {N k ^ 1 {b")y 1 u(b")N k ^ 1 {b") appear in both N k (b) and 
N k (b') they cancel out (whatever they compute to). Thus 

JV fc (6)(JV fc (6'))- 1 = 6v fe _!(&) J] (^ fe _ 1 (6 // ))" 1 w(6 // )^_i(6 // ))(^ fe -i(b / ))" 1 

6<b"<fc' 



Let b — b < bi < ■ ■ ■ < b x -i < b x — b' be all positions in B tk between b and 
b' . By the requirements of the lemma the only positions of B between bj and 
b i+ i arc in Ui<fc ^ ti - Writing out the product we get 



By I.H. Nk-iibi) (iV fc _i(&;+i)) = IlJ=tfi«W- Hence N k {b){N k {b')) = U^ b \ 



The following Lemma shows that w(0)iV| T |(0) gives the product of the group 
elements. 

Lemma 25. We have that u(0)N m (0) = IL U W- 

Proof. Using appropriate induction hypothesis we get that 7Vi t i(0) = Yl\ =1 u(i), 
where I > max(B). The lemma now follows from Lemma 23 which gives that 
u(i) = 1q for every i in the infinite interval. □ 

We now give active domain formulas 7" 1 , m € G, such that j m is true 
iff iV|T|(0) = m. For this we make use of the inductive definition of N k and 
show that there exists active domain formulas 7™ such that w \= 7 m (fe) ^ 
Nk(b) = m. Similarly we give active domain formulas 7" 1 such that w |= 
7 m (&) ^ Nk(b) = m. Observe that Nk(b) is got by computing the product 



of ( N/.-i(b') J u(b')Nk-i(b'), over 6', where b' strictly increases. This requires 



us to traverse the elements in B tk _ 1 in an increasing order. The following section 
builds a Sorting tree to sort the elements of B tk _ 1 in an increasing order. 

5.3 Sorting Tree 

Let t e T. The aim of this section is to create a data structure, which can 
traverse the elements in B t in an ascending order. 

For a t € T, we define a tree called sorting tree, % which corresponds to B t . 
The tree satisfies the following property. If the leaves of the tree are enumerated 
from left to right, then we get the set B t in ascending order. A node in % is 
labeled by a tuple (/, A), where f(x±, . . . ,x{) is a function in F t , A an assignment 
for the variables in / such that A(x\) > A(x 2 ) > ■ ■ ■ > A{x{) and Vi < / : A(xi) G 
nnp{w). 

We show how to inductively built the tree. The root is labeled by the tuple 
(t, {}), where t is the function which depends only on y (and hence constant on 
x) and {} is the empty assignment. The root is not marked a leaf node. 

Consider the internal node (f(xi, . . . , xi),A). It will have three kinds of chil- 
dren ordered from left to right as follows. 




1 



i=l 





1. Left children: These are labeled by tuples of the form (f' a ,Aj) where 
f' a (x 1 ,...,xi +1 ) = f(x 1 ,..., xi )+ ax i+1 and -A < a < 0, -a e N, 
A'j = A U [a;j+i i-> i], where j < A{x{) and j € nnp(w). 

The tuples (/^.^J is on the left of (f' a2 ,A' h ) if ji > j 2 or if ji = j 2 and 

Oil < «2- 

2. Middle child: It is labeled by the tuple (f",A) where f"(xi, . . . , xi) = 
f(xi, . . . , xi). It is marked a leaf node. 

3. Right children: These are labeled by tuples of the form {f' a ,A'-) where 
f' a (x-i,...,xi +1 ) = f(xi, . . . ,x t ) + axi +1 and < a < A, a e N, A'j = 
A U [a?i+i i y j], where j < A{x{) and j G nnp(w). 

The tuple {f' ai ,A' h ) is on the left of (f' a2 , A' j2 ) if j\ < h or ji = j 2 and 
ot\ < a>2- 

Observe that if there is no j such that j < A(xi) and j e nnp(w), then (/, A) 
will only have the child (/", A). 

Note that in our tree construction the values of the children of a node increase 
from left to right. The tree is built until all functions with s variables appear in 
leaves and hence the depth of the tree is s + 2. Figure 1 shows part of a tree, 
where A = 2, t = 0, K = 5 and nnp(w) = {5, 25, 625} C V K . 




Fig. 1. Sorting Tree: The double circles represent leaves of the tree. The nodes of the 
tree are labelled (/, ^4) , where A is an assignment for the function / and t = 0. For 
better presentation we only show the assignment to the newly introduced variable in a 
node. For example, the tuple (x — 2y, 25) assigns x = 625 and y = 25. The assignment 
to x is given in the node's parent. 

The following lemma holds if 1Z > 3sA. We also assume that nnp(w) C D-ji. 
Given a node (/, A), we say the value of the node is the function / evaluated 
under the assignment of A (denoted by f(A)). 

Lemma 26. Let N be an internal node labeled by a function f(xi, . . . ,xi) with 
I < s and an assignment A. If A{x{) = 1Z C for some c > 1, then the children of 



this node have values in the range [f{A) — A1Z C 1 , f{A) + A1Z C 1 \. Moreover the 
values of the children increases from left to right. 



Proof. By construction. □ 

Next we show that for any two neighboring nodes in the tree, the values in 
the leaves of the subtree rooted at the left node is less than the values in the 
leaves of the subtree rooted at the right node. Let V(/.a) denote the set of values 
in the leaves of the subtree rooted at (/, A). 

Lemma 27. Let (/, A) and (/', A') be neighboring nodes of the same parent 
such that {f, A) is to the left of (f',A'). Then u < v for every u G Vif,A) an d 
veV {fl<A ,y 

Proof. Let / = ctiXi + aixi+t and /' = Yl\=i caxi + a^xi+t. We show that 

the rightmost element, u in V(j t A) is less than the left most element, v in V(/'.a')- 
From Lemma 26 and applying induction on the depth of the tree, one can show 
that u < f(A) + (s - I) ATI - 1 and v > f'{A') - (s - l)ATZ c ''\ Here K c ,ll c ' 
are the minimum assignments in A and A' respectively. Let us assume that both 
coefficients a[,ai > 0. A similar analysis can be given for other combinations of 
a[ and a/. Now since (/, A) is the left neighbor of (f',A r ) we have 1Z C < 1Z C . 
Thenv~u> a[n c ' -{s-l)An c '- 1 -a l n c -{s-l)ATZ c - 1 ^W'-SsATZ '- 1 > 0. 
The claim follows, since 1Z > 3s A. □ 

The next lemma says that the values of the leaves of the tree increases as we 
traverse from left to right. 

Lemma 28. Let (f,A) and (f',A') be two distinct nodes such that f{A) < 
f(A'). Then (f,A) appear to the left of(f',A'). 

Proof. This follows from Lemma 27. □ 

Lemma 29 (Tree Lemma). Fix t e T. Assume that for every b € B t we have 
an element € G. For all f € F t , m € G let jj 1 (x) be active domain formulas 
such that w,d \= 7™(a;) iff gf(d) = m - Then there are active domain formulas 
F m ' such that w \= r m ' iff U beBt g b - to'. 

Proof. We will use the sorting tree, 7~t corresponding to B t for the construction 
of our formula. Recall that the nodes are labeled by tuples (/, A), where / is a 
function and A is the assignment of the parameters of /. Let V(/,a) C B t be the 
set of values at the leaves of the subtree rooted at the node labeled by (/, A), 
and g(f.A) = Ilbev^ A) 9b- We will do induction on the depth D of the tree. Let 
r™' D (a?) be a formula such that w,d\= r™ ,Z5 (a;) iff nbeV( /d) 9b — 171 where 
(/, d) is the label of a node that has a subtree of depth at most D. Hence we 
multiply all group elements gb for which b is in V/f^y 
Base Case (leaves): We define r™'°(x) = 7™(x). 



Induction Step: Let us assume that the claim is true for all nodes with a 
subtree of depth at most D. Let the node labeled by (/, A) have a subtree of 
depth D + l. We will need to specify the formula tJ 1 ' D+1 (x), where x agrees with 
the assignment A. For every child (/', A') of (/, A) the depth of the corresponding 
subtree is less than or equal to D. Hence we know we have already formulas by 
induction. 

Recall what the children of (/, A) are: They are of form (f^,A'j) and (/", A,). 
Moreover all nodes (f' a ,A'j), where a is negative, come to the left of (/", A,) 
and all nodes (f^Aj), where a is positive, come to its right. 

We start by grouping some of the children and computing their product. 
We let T~{A'j) be the product of all subtrees labeled by {f' a ,A'-) for a = 
—A,—A+l,...,—l. This is a finite product so we can compute this by a Boolean 
combination of the formulas tJ!' D (x, xi + i). 

nJ^ D (x,x l+1 )::= \f f /\ t%" D {x, x l+1 )\ 

Now we want to compute the product (jlj ennp ( w }(T~ (A^))^ 1 ^ which is the 
product of the T~ (A'j) where j £ nnpfw) is decreasing. But this can be computed 
using an active domain group quantifier, Tj' m ' D {x) as follows: 

T j' m ' D {x) = QZ~\i + i (-A(z m ) A {xi > x i+1 )) 

(TT f (x,Xl +1 ),...,1T f {X,X 1+1 )) 

Recall that the elements of group G are ordered mi, ... , rriK- For the single node 
(/", A) we already have the formulas tJ!; d (x) by induction (here we have x since 
the assignment A is the same for (/, A) and (/", A)). 

Similarly we define formulas irp m ' D (x, xi + \) for the positive coefficients, and 
compute their product rijennpfw) T + (A'j) m an increasing order. 

irp m ' D (x,x l+1 ) ::= \f ( f\ r™°^ D (a;,Xi + i) 

mi-.m^-m ^ a — 1 

r+' m ' D (x) ::= QZx l+1 (^\(x l+1 ) A {x t > x l+1 )) 

(ttT' x ' (jc,a;j + i),...,7rT' {x,xi +1 )) 

We have now computed the product of the group elements for the three different 
groups of children. So by a Boolean combination over these formulas we get 
T f' D+ \x): 

V (V m '' D (z) A rf' D (x) A rp m '"' D (x)) 



m m m —m 



So finally we get r m ' which is same as the formula ^ , which is valid at the 
root of the tree. □ 



Since the above lemma holds only for 1Z > 3s A, our TZ^ should be greater 
than 3sA. 

5.4 Constructing the active domain formula 

We know that for every b £ B there is a function / £ F, di, . . . ,d s ' € nnp(w), 
such that b = f(d,a), where a is the fixed assignment to the variables y. We 
will use this encoding of a position and define a formula v™^ such that 

w, d, a \= v% f (x, y) & N k (f(d, a)) = m 

Similarly we define formulas i>™j such that w,d,a \= 0™j(x, y) iff Nh(f(d, a)) = 
m. 

We show this by induction over k < \T\. Starting with the base case k = 0. 

Lemma 30. Let a e N r . For each m e G, there is an active domain formula 
v™f(x,y) in £[<,+], such that if w \= v™f(d, a) then No(f(d,a)) = m. 
Similarly there is an active domain formula P™f(x,y) in £[<,+] such that if 

w, d |= i>™f(x, a) then N (f(d,a)) — m. 

Proof. For an i < K , we denote by 4> mi the formula Aj<j ?/) ^ 4>i(. x ->y)- 

For a I G N, the following formula checks if there is a point b' in B such that 
b + I = b' . Since in each B there is at most one such element, we can use the 
group quantifier to simulate the existential quantifier. 

4"= V QG 1 x , (f'(x / ,y) = f(x,y) + l, false,..., false) 
feF\f 

So we have that IR(6) = I iff 5 l f +1 A /\ lr<l ->5 l f is true. We define nf' 1 to be true 
if the product of the first I group elements is m. 

g ...gi=m ^ i=0 ' 

Now we have two cases to consider. 

Case IR(6) < p: For each of the case b < b' such that I = b' — b < p, the formula 
7T™' ( compute the product of the group elements. Hence i/^j in this case can be 
given as: 

/\ ((5\(x,y)^l\^8 i ;(x,y))^nf l (x,y)\ 
i=a ^ ('=0 ' 

Case IR(&) > p: When b' — b > p we have to compute the product for the first 
r group elements, where b + r = p and r < p. Therefore i/™^ in this case is 

p-i 

f\ (/(x,y)+r= p 0)^^ r (x,y) 

1=0 

A Boolean combination over 5 l f can differentiate the two cases. Similarly we can 
give active domain formulas i>™f(x, y). □ 



The induction step follows. 

Lemma 31. Let a G N r . For each m G G, there is an active domain formula 
v™j: in £[<, +], such that w, d, a |= v™f(x, y) then N k (f(d, a)) — m. 
Similarly there is an active domain formula v™j{x,y) in £[<,+], such that 

w,d,a\= i>^j{x, y) then N k (f(d, a)) = m. 

Proof. For all m G G and /' G F tk we give formulas 'yjf such that for all 
d! G nnp(w) s the following holds. Let f'(d',a) = b' and f(d,a) = b. Then 

w,d,d',a |= jf(x,z,y) <f> b' < b&ndm = 1 G or b' > b and ^7V fc _i(6')^ u(b')N k 
m. By induction hypothesis there exists formulas v™_ x ^ and i / k n _ 1 f, which cor- 
responds to Nk-i(f'(x,y)) and N k -i(f' (x, y)) respectively. Taking a Boolean 
combination over these formulas we get the required formula 7™. We now apply 
our Tree Lemma 29 which gives us formulas _T m , for all m e G, such that 

w,d,a\=r m (x,y)^w^ H (N k _ 1 (b , )y 1 u(b , )N k _ 1 (b / )=m 

b'>b 

Taking Boolean combination over r m and v™_i f will give us the formula 
Similarly we can build active domain formulas £™^(cc,y), for all m G G. □ 

Proof (Proof of Lemma 15). By Lemma 25 we know that it suffices to compute 
Nyr\ (0) and by Lemma 31 we know that there are active domain formulas r m G 
£[<, +] such that N m (0) = m iff w, 0, a \= T m 

We need to do one last thing. Check that the infinite interval evaluates to 
1g- Replace all formulas z > p, z < p, c(z) for ac^A and \(z) by true, false, 
false, true respectively in the formulas fa and call these formulas ?/V There 
exists a witness in the infinite interval for the formula fa iff tpi evaluates to true. 
By Theorem 23 there should not be any witness in the infinite interval. Hence 
there exists a which evaluates to true iff the infinite interval does not evaluate 
to 1 G . □ 



6 Discussion 

We have shown that in the presence of a neutral letter the addition relation 
collapse to linear ordering no matter what monoid quantifier is been used. All 
languages definable using monoid quantifiers and an order predicate, on the other 
hand, are regular [3]. Now using semigroup theoretic methods we can separate 
these classes [26]. This enabled us to show separation between various logics 
which uses addition and order predicates. 

Unfortunately if both addition and multiplication are present, then the col- 
lapse does not happen. It is also interesting to note that non-solvable groups 



do not show any surprising property if only addition is present, but as we know 
from Barrington's theorem non-solvable groups behave quite differently when 
both addition and multiplication are present. 

The ultimate objective is to show non-expressibility results for arbitrary pred- 
icates or at least when both addition and multiplication are present. As a first 
step one can look at extending these results for other kinds of predicates. 

Another way to look at separating the "natural uniform" versions of the 
complexity classes will be to ask whether one can come up with other suitable 
restrictions on the set of languages. Inside this restricted set of languages can one 
show addition and multiplication collapse to order relation? This seems to be 
the idea Straubing considers in [27] . Straubing [26] proposes word problems over 
Regular language as a suitable restriction, while McKenzie, Thomas, Vollmer 
[20] consider context free languages as a restriction. 

Another interesting question which our result fails to answer is whether word 
problems over non-solvable groups can be defined in MAJ[<, +] [13]? 
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